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Improvements to gra ph colorin g registe r allocation 
Preston Briggs, Keith D. Cooper, Linda Torczon 

May 1994 ACM Transactions on Programming Languages and Systems (TOPLAS), 

Volume 16 Issue 3 

Publisher: ACM Press 

Additional Information: full citation , abstract , references , citin gs, index 
terms, review 



Full text available: Pdf f2.00 MB ) 



We describe two improvements to Chaitin-style graph coloring register allocators. The 
first, optimistic coloring, uses a stronger heuristic to find a k-coloring for the interference 
graph. The second extends Chaitin's treatment of rematerialization to handle a larger 
class of values. These techniques are complementary. Optimistic coloring decreases the 
number of procedures that require spill code and reduces the amount of spill code when 
sp ... 



Keywords: code generation, graph coloring, register allocation 



^ O ptimiz ing document format: Two diet plans for fat PDF 
Thomas A. Phelps, Robert Wilensky 
^ November 2003 Proceedings of the 2003 ACM symposium on Document engineering 

Publisher: ACM Press 

Full text available: pdf ( 198.98 KB) Additional Information: full citation , abstract , references , index terms 



As Adobe's Portable Document Format has exploded in popularity so too has the number 
PDF generators, and predictably the quality of generated PDF varies considerably. This 
paper surveys a range of PDF optimizations for space, and reports the results of a tool 
that can postprocess existing PDFs to reduce file sizes by 20 to 70% for large classes of 
PDFs. (Further reduction can often be obtained by recoding images to lower resolutions or 
with newer compression methods such as JBIG2 or JPEG2000, bu ... 

Keywords: PDF, compact PDF, compression, multivalent 



Scalable feature selectio n, classification and si g nature g eneration for or ganizing 

larg e text databases into hierarchica l to pic taxonomi es 

Soumen Chakrabarti, Byron Dom, Rakesh Agrawal, Prabhakar Raghavan 

August 1998 The VLDB Journal — The International Journal on Very Large Data 

Bases, volume 7 Issue 3 

Publisher: Springer-Verlag New York, Inc. 

Full text available: pdf( 281.37 KB ) Additional Information: full cita tion, abstract , citin gs, index ternns 

We explore how to organize large text databases hierarchically by topic to aid better 
searching, browsing and filtering. Many corpora, such as internet directories, digital 
libraries, and patent databases are manually organized into topic hierarchies, also called 
taxonomies. Similar to indices for relational data, taxonomies make search and access 



more efficient. However, the exponential growth in the volume of on-line textual 
information makes it nearly impossible to maintain such taxono ... 



The strin g B-tree: a new data structure for strin g search in ext ernal memor y and its 
ap plications 

Paolo Ferragina, Roberto Grossi 

March 1999 Journal of the ACM (JACM), Volume 46 issue 2 
Publisher: ACM Press 

Additional Information: full citation, abstract, refere nces. cLtLngs, index 



Full text available: m pdf(36137KB) 

terms 

We introduce a new text-indexing data structure, the String B-Tree, that can be seen as a 
link between some traditional external-memory and string-matching data structures. In a 
short phrase, it is a combination of B-trees and Patricia tries for internal-node indices that 
is made more effective by adding extra pointers to speed up search and update 
operations. Consequently, the String B-Tree overcomes the theoretical limitations of 
inverted files, B-trees, prefix B-trees, s ... 

Keywords: B-tree, Patricia trie, external-memory data structure, prefix and range 
search, string searching and sorting, suffix array, suffix tree, text index 



Extractin g sentence se g ments for text summarization: a machine learnin g ap proach 
Wesley T. Chuang, Jihoon Yang 

July 2000 Proceedings of the 23rd annual international ACM SIGIR conference on 
Research and development in information retrieval 

Publisher: ACM Press 

Additional Information: full citation , abstract , references , citin gs, index 



Full text available: pdff 945.26 KB) 

'* terms 

With the proliferation of the Internet and the huge amount of data it transfers, text 
summarization is becoming more important. We present an approach to the design of an 
automatic text summarizer that generates a summary by extracting sentence segments. 
First, sentences are broken into segments by special cue markers. Each segment is 
represented by a set of predefined features (e.g. location of the segment, average term 
frequencies of the words occurring in the segment, number of title words ... 

Keywords: machine learning, sentence segment extraction, text summarization 



A patent search and classification system 
Leah S. Larkey 

August 1999 Proceedings of the fourth ACM conference on Digital libraries 
Publisher: ACM Press 

Full text available: W\ pdf (164.37 KB ) Additional Information: full citation , references , citin gs, index terms 



Keywords: applications, classification, digital libraries, information retrieval, patents, 
systems, text categorization 



Machine learnin g in automated text cate g orization 
Fabrizio Sebastiani 

March 2002 ACM Computing Surveys (CSUR), Volume 34 issue 1 
Publisher: ACM Press 

Additional Information: full citation , abstra ct, references , citings, index 



Full text available: TO pdff 524.41 KB ) 

t erms 

The automated categorization (or classification) of texts into predefined categories has 
witnessed a booming interest in the last 10 years, due to the increased availability of 
documents in digital form and the ensuing need to organize them. In the research 
community the dominant approach to this problem is based on machine learning 
techniques: a general inductive process automatically builds a classifier by learning, from 



a set of preclassified documents, the characteristics of the categories. 
Keywords: Machine learning, text categorization, text classification 



^ Im provin g the representation of le g al case texts with information extraction nnethods 
% Stefanie Bruninghaus, Kevin D. Ashley 

May 2001 Proceedings of the 8th international conference on Artificial intelligence 

and law 
Publisher: ACM Press 

Additional Information: f ull cita tion, abstract , refer ences , citin gs, index 



Full text available; TO pdf(188.30 KB) 

t erms 

The prohibitive cost of assigning indices to textual cases is a major obstacle for the 
practical use of AI and Law systems supporting reasoning and arguing with cases. While 
progress has been made toward extracting certain facts from well-structured case texts or 
classifying case abstracts under Key Number concepts, these methods still do not suffice 
for the complexity of indexing concepts in CBR systems. 

In this paper, we lay out how a better example representation may facilitate cla ... 

^ Te chniq ue for automatically correcting words in text 
ifc:, Karen Kukich 

December 1992 ACM Computing Surveys (CSUR), volume 24 issue 4 
Publisher: ACM Press 

Additional Information: full citation , abstract , references , citings, index 



Full text available: TO pdf(6. 23 MB ) 

terms , review 

Research aimed at correcting words In text has focused on three progressively more 
difficult problems:(l) nonword error detection; (2) isolated-word error correction; and (3) 
context-dependent work correction. In response to the first problem, efficient pattern- 
matching and n-gram analysis techniques have been developed for detecting strings that 
do not appear in a given word list. In response to the second problem, a variety of 
general and application-specific spelling cor ... 

Keywords: n-gram analysis, Optical Character Recognition (OCR), context-dependent 
spelling correction, grammar checl<ing, natural-language-processing models, neural net 
classifiers, spell checking, spelling error detection, spelling error patterns, statistical- 
language models, word recognition and correction 



0 The FINITE STRING Newsletter: Abstracts of current literature 
Computational Linguistics Staff 

January 1987 Computational Linguistics, Volume i3 issue 1-2 
Publisher: MIT Press 

Full text available: ^^^^ ^.r^^(Sl 

'g].txjf(6.15 MBL^J Additional Information: full citation 

Publisher Site 



^ Interactive Editin g S ystems: Part II 
^ Norman i^eyrowitz, Andries van Dam 

^ September 1982 ACM Computing Surveys (CSUR), Volume 14 issue 3 
Publisher: ACM Press 

Full text available: ^ pdf f 9.17 MB) Additional Information: full citation , references , citings, index terms 



^ Data structures for efficient brok er i mplem entation 
/iikv Anthony Tomasic, Luis Gravano, Calvin Lue, Peter Schwarz, Laura Haas 
^ July 1997 ACM Transactions on Information Systems (TOIS), volume is issue 3 

Publisher: ACM Press 



Full text available: pdff31 6.45 KB ) Additional Infornnation: full citation. abstra.ct , reference s, citings, index 

terms , review 

With the profusion of text databases on the Internet, it is becoming increasingly hard to 
find the most useful databases for a given query. To attack this problem, several existing 
and proposed systems employ brokers to direct user queries, using a local database of 
summary information about the available databases. This summary information must 
effectively distinguish relevant databases and must be compact while allowing efficient 
access. We offer evidence that one broker, GIOSS 

Keywords: GIOSS, broker architecture, broker performance, distributed information, grid 
files, partitioned hashing 



^ ^ Enha nced h ypertext categoriz ation using hyperlinks 
Soumen Chakrabarti, Byron Dom, Piotr Indyk 

June 1998 ACM SIGMOD Record , Proceedings of the 1998 ACM SIGMOD international 

conference on Management of data SIGMOD '98, volume 27 issue 2 
Publisher: ACM Press 

Additional Information: full citation , abstract , references , citin gs, index 



Full text available: ri Pdf( 1.91 MB) 

terms 

A major challenge In indexing unstructured hypertext databases Is to automatically 
extract meta-data that enables structured search using topic taxonomies, circumvents 
keyword ambiguity, and improves the quality of search and profile-based routing and 
filtering. Therefore, an accurate classifier is an essential component of a hypertext 
database. Hyperlinks pose new problems not addressed in the extensive text classification 
literature. Links clearly contain high-quality semantic clues that ... 

Inverted files versus si g nature files for text indexin g 
Justin Zobel, Allstair Moffat, Kotagiri Ramamohanarao 

December 1998 ACM Transactions on Database Systems (TODS), volume 23 issue 4 
Publisher: ACM Press 

Additional Information: full citation , abstract , references , citin gs, index 



Full text available: W pdff2 43.62 KB ) 

terms 

Two well-known indexing methods are inverted files and signature files. We have 
undertaken a detailed comparison of these two approaches in the context of text 
indexing, paying particular attention to query evaluation speed and space requirements. 
We have examined their relative performance using both experimentation and a refined 
approach to modeling of signature files, and demonstrate that inverted files are distinctly 
superior to signature files. Not only can inverted files be used to ev ... 

Keywords: indexing, inverted files, performance, signature files, text databases, text 
indexing 



Information retrieval on the web 
Mel Kobayashi, Koichi Takeda 

June 2000 ACM Computing Surveys (CSUR), volume 32 issue 2 
Publisher: ACM Press 

Additional Information: full citation , abstract , references , citin gs, index 



Full text available: W Qdf(21 3.89 K B) 

terms 

In this paper we review studies of the growth of the Internet and technologies that are 
useful for information search and retrieval on the Web. We present data on the Internet 
from several different sources, e.g., current as well as projected number of users, hosts, 
and Web sites. Although numerical figures vary, overall trends cited by the sources are 
consistent and point to exponential growth in the past and in the coming decade. Hence it 
is not surprising that about 85% of Internet user ... 

Keywords: Internet, World Wide Web, clustering, indexing, information retrieval, 
knowledge management, search engine 



^ Tree inductio n vs. log istic regression: a learnin g-curve analysis 



Claudia Perlich, Foster Provost; Jeffrey S. Simonoff 

December 2003 The Journal of Machine Learning Research, volume 4 

Publisher: MIT Press 

Additional Information: full citation , abstract , references , citings, index 



Full text available: W pdf( 263.37 KB) 

^ terms 

Tree induction and logistic regression are two standard, off-the-shelf methods for building 
models for classification. We present a large-scale experimental comparison of logistic 
regression and tree induction, assessing classification accuracy and the quality of rankings 
based on class-membership probabilities. We use a learning-curve analysis to examine 
the relationship of these measures to the size of the training set. The results of the study 
show several things. (1) Contrary to some prior o ... 

Na rratives and Literar y Hy pertext: Readin g and writin g fluid H y pertext Narratives 
Polle T. Zellweger, Anne Mangen, Paula Newman 

June 2002 Proceedings of the thirteenth ACM conference on Hypertext and 

hypermedia 
Publisher: ACM Press 

Additional Information: full citation , abstract , references , citin gs, index 



Full text available: W pdf( 417.30 KB ) 

terms , review 

We describe a new way to present and author hypertext narratives. The Fluid Reader 
constructs a unified interactive text from the content of multiple nodes and allows a 
reader to explore alternative paths within it. The Fluid Reader has been available as a 
hands-on museum exhibit for nearly a year to date, where it has been enjoyed by readers 
of all ages. Its success has prompted further interest and development in Fluid 
hypertexts. We have designed and implemented an authoring tool called the F ... 

Keywords: authoring, fluid documents, fluid hypertext, fluid reader, fluid writer, 
hypertext narrative, stretchtext, treetable, visualization 



s S pecial issue on knowled g e representation 
Ronald J. Brachman, Brian C. Smith 
February 1980 ACM SIGART Bulletin, issue 70 

Publisher: ACM Press 

Full text available: pdf f13.13 MB ) Additional Information: full citation , abstract 

In the fall of 1978 we decided to produce a special issue of the SIGART Newsletter 
devoted to a survey of current knowledge representation research. We felt that there 
were twe useful functions such an issue could serve. First, we hoped to elicit a clear 
picture of how people working in this subdiscipline understand knowledge representation 
research, to illuminate the issues on which current research is focused, and to catalogue 
what approaches and techniques are currently being developed. Secon ... 

9 KM-I (k novuled qe management): clu sterin g I: Hierarchical document cate g orizatio n 

k v\/ith su p port vector machines 
Lijuan Cai, Thomas Hofmann 

November 2004 Proceedings of the thirteenth ACM international conference on 
Information and knowledge management CIKM '04 

Publisher: ACM Press 

Full text available: pdf ( 248.71 KB) Additional information: full citation , abstract , references , index terms 

Automatically categorizing documents into pre-defined topic hierarchies or taxonomies is 
a crucial step in knowledge and content management. Standard machine learning 
techniques like Support Vector Machines and related large margin methods have been 
successfully applied for this task, albeit the fact that they ignore the inter-class 
relationships. In this paper, we propose a novel hierarchical classification method that 
generalizes Support Vector Machine learning and that is based on discrlmi ... 

Keywords: SVM, class relationship, document categorization, hierarchical loss, subspace 
optimization, taxonomy 



Alg orit hm s a nd data structure s for fl ash m emories 
Eran Gal, Sivan Toledo 
^ June 2005 ACM Computing Surveys (CSUR), volume 37 issue 2 

Publisher: ACM Press 

Full text available: Q pdf(34 3.39 KB ) Additional Information: full citation, abstract, references , index terms 

Flash memory is a type of electrically-erasable programmable read-only memory 
(EEPROM). Because flash memories are nonvolatile and relatively dense, they are now 
used to store files and other persistent objects in handheld computers, mobile phones, 
digital cameras, portable music players, and many other computer systems in which 
magnetic disks are inappropriate. Flash, like earlier EEPROM devices, suffers from two 
limitations. First, bits can only be cleared by erasing a large block of memory. S ... 

Keywords: EEPROM memory. Flash memory, wear leveling 
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S25 


20 


S22 AND AC=US AND AY= (1970 : 2001) /PR 


S26 


13 


S22 AND- PY=1970:2001 


S27 


25 


S23 :S26 


S28 


1649 


S1(5N) (MATRIX OR MATRICE? ? OR EVALUAT?) 


S29 


16 


S28 AND PATENT? ? 


S30 


15 


S29 NOT (S21 OR S2:S3 OR S8 OR Sll OR S15 OR S17) 


S31 


70 


SI (5N) CHART??? 


S32 


3 


S31 AND PATENT? ? 


S33 


455 


SI (5N) CHECK??? 


S34 


4 


S3 3 AND PATENT? ? 


S35 


3 


S34 NOT S32 



File 275:Gale Group Computer DB{™) 1983-2006/Jan 06 

(c) 2 0Q6 The Gale Group 
File 621:Gale Group New Prod.Annou. (R) 1985-2006/Jan 06 

(c) 2 006 The Gale Group 
File 636:Gale Group Newsletter DB(TM) 1987-2006/Jan 05 

(c) 2006 The Gale Group 
File 16:Gale Group PROMT(R) 1990-2006/ Jan 06 

(c) 2006 The Gale Group 
File 160:Gale Group PROMT (R) 1972-1989 

(c) 1999 The Gale Group 
File 148:Gale Group Tr^.de & Industry DB 1976-2 006/Jan 06 

(c)2006 The Gale Group 
File 624 :McGraw-Hill Publications 1985-2006/Jan 06 

(c) 2006 McGraw-Hill Co. Inc 
File 15 :ABI/Inform(R) 1971-2006/Jan 06 

(c) 2006 ProQuest Inf o&Learning 
File 647: CMP Computer Fulltext 1988-2006/ Jan W2 

(c) 2006 CMP Media, LLC 
File 674: Computer News Fulltext 1989-2005/Oct W2 

(c) 2005 IDG Communications 
File 696:DIALOG Telecom. Newsletters 1995-2006/Jan 05 

(c) 2006 Dialog 
File 369:New Scientist 1994-2005/Aug W2 

(c) 2005 Reed Business Information Ltd. 
File 810:Business Wire 1986-1999/Feb 28 

(c) 1999 Business Wire 
File 813: PR Newswire 1987-1999/Apr 30 

(c) 1999 PR Newswire Association Inc 
File 610:Business Wire 1999-2006/Jan 06 

(c) 2006 Business Wire. 
File 613: PR Newswire 1999-2006/Jan 06 

(c) 2006 PR Newswire Association Inc 

Set Items Description 

51 2068582 CLAIM? ? 

52 52729 S1(5N) (DEPENDENC? ?? OR RELATIONSHIP? ? OR REFER??? OR REFE- 

RENC??? OR ASSOCIAT???? OR LINK???? OR CONNECTION? ?) 

53 42377 S1(5N) (ORIENT?????? OR 0RGANI2? OR ORGANIS? OR ARRANG?????? 

OR POSITION??? OR STRUCTUR??? OR MAP???? OR CONFIGUR?) 

54 74453 S1(5N) (GRAPHIC? OR PICTURE? ? OR PICTORIAL OR VISUAL? OR R- 

EPRESENT? OR DISPLAY??? OR DEPICT? OR DIAGRAM??? OR ANALYZ? OR 
ANALYS? OR MATRIX OR MATRICE? ? OR EVALUAT? OR CHART??? OR C- 
HECK???) 

55 552891 TREE? ? OR HIERARCH? OR PARENT () CHILD 

56 383828 (SI OR PATENT? ?) {5N) (AUTHOR??? OR DRAFT??? OR WRIT???? OR 

GREAT???. OR GENERAT??? OR CONSTRUCT??? OR FORM OR FORM? ? OR - 
FORMED OR FORMING OR FORMATION OR DEVELOP??? OR BUILD??? OR B- 
UILT OR PRODUC???? OR COMPOS?????) 

57 2010 S1(10N)S5 



S8 


28 


S2 :S4 (50N) S7 (50N) S6 


S9 


19 


RD (unique items) 


SIO 


2418 


S2 :S4 (50N) PATENT? ?(50N)S6 


Sll 


32175 


S1(5N) (GRAPHIC? OR PICTURE? ? OR PICTORIAL OR VISUAL? OR R- 




EPRESENT? OR DISPLAY??? OR DEPICT? OR DIAGRAM??? OR MATRIX OR 




MATRICE? ?) 


S12 


11 


S2 :S3 (50N) S4 (50N) Sll (50N) S6 (50N) PATENT? ? 


S13 


8 


RD (unique items ) 


S14 


2814 


Sll (50N)S6 


S15 


197 


Sll (50N) S6 (SON) PATENT? ? 


S16 


117 


RD (unique items ) 


S17 


109 


S16 NOT (S9 OR S13) 


S18 


185490 


PATENT? ?(5N) (AUTHOR??? OR DRAFT??? OR WRIT???? OR GREAT??? 



OR GENERAT??? OR CONSTRUCT??? OR FORM OR FORM? ? OR FORMED OR 
FORMING OR FORMATION OR DEVELOP??? OR BUILD??? OR BUILT OR P- 
RODUC???? OR COMPOS?????) 



519 269 CLAIM? ?(3N)TREE 

520 • 0 S18 AND S19 

521 1 S19 (SON) PATENT? ? 

522 898 Sll (3 ON) PATENT? ? 

523 351 SI (5N)HIERARCH? 

524 7 S23 (SON) PATENT? ? 

525 5 RD (unique items) 



File 8:Ei Compendex{R) 1970-2006/Dec W4 

(c) 2006 Elsevier Eng. Info. Inc. 
File 35 :Dissertation Abs Online 1861-2005/DeG 

(c) 2005 ProQuest Inf o&Learning 
File 65: Inside Conferences 1993 -2006/Jan Wl 

(c) 2006 BLDSC all rts . reserv. 
File 2:INSPEC 18 98r 2 005/Dec W2 

(c) 2005 Institution of Electrical Engineers 
File 94 : JICST-EPlus 1985-2006/Oct W5 

(c)2006 Japan Science and Tech Corp ( JST) 
File 6:NTIS 1964 -2005/Dec W2 

(c) 2005 NTIS, Intl Cpyrght All Rights Res 
File 144: Pascal 1973 -2005/Dec W2 

(c) 2005 INIST/CNRS 
File 434 :SciSearch{R) Cited Ref Sci 1974-1989/Dec 

(c) 1998 Inst for Sci Info 
File 34 :SciSearch(R) Cited Ref Sci 1990-2006/ Jan Wl 

(c) 2006 Inst for Sci Info 
File 99: Wilson Appl . Sci & Tech Abs 1983 -2005/Oct 

(c) 2005 The HW Wilson Co. 
File 266:FEDRIP 2005/Dec 

Comp & dist by NTIS, Intl Copyright All Rights Res 
File 95 rTEME- Technology & Management 1989-2006/Jan Wl 

(c) 2006 FIZ TECHNIK 
File 583:Gale Group Globalbase .(TM) 1986-2002/Dec 13 

(c) 2 002 The Gale Group 
File 483 :Newspaper Abs Daily 1986-2006/Jan 05 

(c) 2006 ProQuest Inf o&Learning 
File 256 :TecInfoSource 82-2005/Feb 

(c) 2005 Info. Sources Inc 



Set Items Description 

51 289323 CLAIM? ? 

52 5406 S1(5N) (DEPENDENC?? ? OR RELATIONSHIP? ? OR REFER??? OR REFE- 

RENC??? OR ASSOCIAT???? OR LINK???? OR CONNECTION? ?) 

53 4198 S1(5N) (ORIENT?????? OR ORGANIZ? OR ORGANIS? OR ARRANG?????? 

OR POSITION??? OR STRUCTUR??? OR MAP???? OR CONFIGUR?) 

54 6657 S1(5N) (ANALYZ? OR ANALYS? OR EVALUAT? OR CHART??? OR CHECK- 

??? ) 

55 3078 S1(5N) (GRAPHIC? OR PICTURE? ? OR PICTORIAL OR VISUAL? OR R- 

EPRESENT? OR DISPLAY??? OR DEPICT? OR DIAGR/UVI??? OR MATRIX OR 
MATRICE? ? OR CHART???) 

56 868406 TREE? ? OR HIERARCH? OR PARENT () CHILD 

57 33363 (SI OR PATENT? ?) (5N) (AUTHOR??? OR DRAFT??? OR WRIT???? OR 

GREAT??? OR GENERAT??? OR CONSTRUCT??? OR FORM OR FORM? ? OR - 
FORMED OR FORMING OR FORMATION OR DEVELOP??? OR BUILD??? OR B- 
UILT OR PRODUC???? OR COMPOS?????) 

58 1798 S2:S5 AND S7 

59 53 S8 AND PATENT? ? 

510 46 RD (unique items) 

511 12805 PATENT? ? (5N) (AUTHOR??? OR DRAFT??? OR WRIT???? OR GREAT??? 

OR GENERAT??? OR CONSTRUCT??? OR FORM OR FORM? ? OR FORMED OR 
FORMING OR FORMATION OR DEVELOP??? OR BUILD??? OR BUILT OR P- 
RODUC???? OR COMPOS?????) 

512 719 Sll AND SI 

513 1834 S11(5N) (APPLICATION? ? OR TOOL? ? OR SOFTWARE OR PROGRAM? - 

?) 

514 77 SI AND S13 ^ 

515 62 RD (unique items) \\\^ 

516 58 S15 NOT SIO 

517 235 S1{5N)S6 

518 0 S17 AND PATENT? ? N 



